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Abstract 

We consider the goodness of fit testing problem for linear stochas- 

r__| ■ tic differential equation (Ornstein-Uhlenbeck process). The basic hy- 

C/3 ! pothesis is supposed to be composite with two-dimensional unknown 

r*j ' parameter. We study two goodness of fit tests of Cramer-von Mises 

type based on empirical distribution function and on local time esti- 
mator of the invariant density. It is shown that the limit distributions 
of the underlying statistics under hypothesis do not depend on the 
unknown parameter. 
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1 Introduction 



S^ We consider the construction of the goodness-of-Gt (GoF) tests for stochastic 

differential equation 

dX t = S(X t ) dt + adWt, X , 0<t<T (1) 

We suppose that under basic hypothesis we observe an Ornstein-Uhlenbeck 
(O-U) process 

dX t = -p (X t -a) dt + a dW t , X , 0<t<T (2) 

where d = (a, j3) e •& is unknown parameter. The set = (01,02) x (01,^2) 
and b\ > 0. 



Remind that in the case of i.i.d. observations with distribution F(ft,x) 
depending on unknown parameter ft the Cramer-von Mises test is based on 
the statistics 



n 



F n (x)-F[ft n ,x 



dF ft n ,x 



and its limit distribution depends of the value ft. There are at least two 
particular cases when this limit distribution does not depend on ft : in the 
case of shift and scale parameters. The distributions of the Cramer-von 
Mises type statistics with estimated parameters were studied by Darling [2], 
Durbin [5] and Kac et al. [7]. We consider the similar situation in the case 
of observations of O-U process fl2]). 

In our case we propose two Cramer-von Mises type statistics similar to 
A n . At particularly, we study the statistics like 



A 



T 



T 



F T (x) -F[ft T ,x 



dF[ft T ,x 



(3) 



where Ft (x) is empirical distribution function and $t is the MLE. Remind 
that the invariant distribution F (ft, x) of O-U process §2§ is N ( a, |g J and 
the MLE is asymptotically normal 

VT (0 r - tf) =K (ft) ~ AT (0, 1 (ft)' 1 ) . 

Here I (ft) is the information matrix. It can be shown that under regularity 
conditions the statistic A^ has the following behaviour (as T — > oo) 



An 



T I F T (x) - F (ft, x) 



oo 

oo 



+VTlF(ft,x)-Flft T iX 

7] T (x)-VT( (ft T -#)f (ft, X 



dFl$ T ,x 

dF(ft T ,x 



r](ft,x)-U(ft),F(ft,x)) f(ft,x)dx = A 

where r) (ft, x) is a Gaussian process which admits the representation 

F (ft, x A y) - F (ft, y) F (ft, x) 



rj (ft,x) 



VTWv) 



dW(y). 



Here W (•) is two-sided Wiener process. The distribution of A in general 
case depends on the true (unknown) value $ and the choice of the threshold 
c e for the test i/jt = I{A T >c e } can be a difficult problem. The solution c £ of 
the equation P {A > c e } = s depends on $, i.e.; c £ = c £ (■#). Note that if we 
put c e = c e (^r) then we have to prove the continuity of this function. 

In the present work we study the statistics like A^ in the case of Ornstein- 
Uhlenbeck process ([2]) and show that the limit distributions of these statistics 
do not depend on $. Such tests can be called Asymptotically Parameter Free 
(APF). Therefore the threshold c £ can be choosen before the experiment. 
The study of the goodness of fit problems for continuous time observations 
of the ergodic diffusion processes was discussed in the works Kutoyants [8], 
[9], Dachian and Kutoyants pQ, Negri and Nishiyama [12]. See as well jl] and 
[5] where the close questions were discussed. The case of ergodic diffusion 
process with shift parameter and AFP test was studied by Negri and Zhou 

D3I- 

2 Main result 

We consider two goodness of fit tests of the Cramer-von Mises type with 
statistics like ([3]) based on empirical distribution function and empirical den- 
sity estimators. Remind that the empirical distribution function 

F T {x) = ^J l {Xt<x} dt 

and empirical density estimator (local time estimator) 

It [x) = ^ ^ / sgn (X t - x) dX t 

are unbiased, consistent and asymptotically normal 

VT (P t (x) - F (0, x)) =► A/" (0, d F (tf, xf) , 
^T (/ t (x) - f (#, x)) =► Af (0, d f (^ x) 2 ) 

(see [8], Propositions 1.51 and 1.57). The corresponding Cramer-von Mises 
type statistics are 

d T (X t ) = a 2 T T f T (x) - f U T , x) * dF C& T , x 

J— oo 

3 



and 



A T (X T ) = (3 T T f F T (x) - F C& T , x) 2 dF( 

J — oo 



dF[&T,x). 



Here $t — [o-t^Pt) is the maximum likelihood estimator (MLE) oft?. 
Introduce Gaussian processes 



Cf(x) 



and 



(f(x) 



> Fo{y) - 1{y>x} -2 X+ (l-2 X *)y 

fo(y) 



2 F (y)F Q (x)-F Q (yAx) + 1 + ^ 



VMJ)dW(y) 



fa (y) fa (x) 



Vfa(yjdW(y). 



Here W (y) , y (E R is two-sided Wiener process. Define two constants c £ and 
d e by the equations 



( f (x) 2 fa (x) 3 dx > c e 



-oo 
oo 



e, 



(f {xf fa {xf dx> d £ \ =e 



P 

P 

The class of tests of asymptotic size 1 — e we denote as K, £ . 

Theorem 1 The tests 

4j t (x t ) = 1 {St>Ce} , (t>T {x T ) = I {AT>d£ } 

belong to K, £ . 

Proof. We start with St (X t ) . Let us introduce three stochastic processes 
f]T (•) , ia,T (•) and ^t (•) by the following relation 

^/T[f T {x)-f(^T,x)) 

= VT(f T (x)-f(0,xj)+VT(f(#,x)-f(#T,x 

= VT (x)-(VT($ T -#),f(^,x 



Vt (x) - Vf (or - a) ^fc - VT (& - 

^t (x) - £ a ,T (x) - £/3,r (x) + r T . 



5/3 



r T 



Here t]t (x) = vT f fr (x) — f (i9, x) ) etc. The function / (•) has continuous 

bounded derivatives, therefore for the term tt we have E$r T —¥ 0. 
Introduce as well the random process 



n — 



0<t<T. 



The equation (J2J) we can re-write as 

d ^ ( ^- a) =- ^ (Xt - a) d(/3t) + v ^dH/ f , X , 0<t<T 
a a 

and so we obtain the equation for Z t : 

dZ t = -Z t d (/3t) + y^dW t , Z , 0<t<T. 
If we put Y s = Z*, < s < T/3, then the equation for Y" s is 

dY s = -Y s ds + dw s , Y , 0<s< T/3, 

where w s = y/~j3W s /p, s > is a Wiener process. The invariant density of this 
process / (y) is TV (0, ~) . We have 



P . [ P , A P fVP(x-a) 



f(&,x) = \ — o ex H ^(x-a) [■ = \ — ^exp 

V 7Ta z I a z J V na 2 I V <~r 

cr V °" / V 71 " 

Therefore 



2" 



'J/ 


1 fi 


exp 

-V 2 


v 

-00 ' 


ixa 2 

- a ) 
e 



a 

dv = F ( ^ {y - a] 
-k \ a 

Our goal is to represent the statistics A^ and St as some integral type 
functionals of Y s ,0 < s < T/3. These represntations will alow us to show 
that the limit distributions of these statistics do not depend on $. 

Remind that local time estimator admits the representations : 

VT[) VT J Xo o*f{#,y) dy 



2/(0, a;) f 1 l {Xt>x} -F($,X, 



T Jo °fW,X t ) 



dW,. 



Hence we can write 

2/(0, Z) [ T l {Xt>x} -F($,X t 



Vt(x) 



'T Jo <rf(#,X t ) 

2/o (g) f T l { z t>z} - Fp (Z t ) 



■dW t + (1) 



[dZ t + Z t d(tp)] + o(l) 



2/o (z) f T ^l {Ys>z} -F (Y s ) 



VW J 
B T p{Y,z 
o 



ok (Y s ) 
fo(z) + o(l). 



[dY s + Y s ds] 



For convergence (1) see the proof of the Theorem 4.6 in [8]. The processes 
Ca,T (3;) and £p t T (x) can be written as 



ia,r (x) = (a T - a] 



da 



where 

df(#,x) 2(3 3 / 2 {x-a) 



da 



<7 3 \/tt 



exp <; -— (x - a) 



2/3 yffl(x-a) 



a 2 yfii a 



o* 



exp 



\//3 (x — a) 



a 



2/3 

-2 * /o (*) 



and 



^,t(x)= /3 T -/3 



0(3 



where 










df(#,x) 




/ /3 (x — a) 
V cr 2 7T cr 2 


8(3 


2v/7ra 2 /3 




1 


1 
2 ~~ 


/v^(x-a)\ 2 " 




s/aH(3 


c 


1 


1 
2 


-z 2 


/o (*) • 



exp <; -— [x - a) 



a* 



exp 



\/!3~{x — a) 



Further 



X T - X Q 1 / 



and 



h 



a T {X T -X )- j* X t dX, 
f Q T (X t - a T ) 2 dt 



Hence 



y/T(a T -a) = -^= f [dX t + p(X t -a)dt] + o(l) 

f3\/T Jo 



a 



^/ 2 Vr J0 

r T/3 



'■' A VP(x t - a) + VP(x t - a ) ) 



a 



a 



+ o(l) 



P- 



y= J [dY s + Y s ds] + o (1) = -M T0 (Y) + o (1) 



For the second estimator similary we obtain 



i r T P- 



,-,. n ^-^J^YsldYs + Ysds] 



1 CYs 2 ds 



rti 



Note that by the law of large numbers 



Hence 



l r TIB i 



2 r Tp 

T[{3 T -f3) = -V/3^= I Y s [dY s + Y s ds] + o (1) 



jrpj 

= -2y/pN Tf >(Y) + o(l). 
Finally we write 

Vt(/t(s)-/(£t,x)) 

= [S T/3 (y, z) - 2z M T/3 (y) + (1 - 2z 2 ) N TP (Y)] /o (z) + r T (z) 

Let us consider the vector 

(B Tp - (y, Zl ),..., b t ? (y, * fc ) , m ti3 (Y) , n tp (y)) . 



For the linear combination of its component by the central limit theorem we 
have 



J2 ^iBt? (Y, zi) + A fc+1 M T/3 (Y) + X k+2 N Tp (Y) 



1=1 



*Tp 



T3 






1=1 



fo(Y s 



dw s 



„Tp fc+2 



-1 pip "-T^ 

== / V A,*, (Y s , zi) dw s => M (0, D) 

1 P Jo i=i 



where 



fe+2 fc+2 



D = ^2Y1 XiXm / $ ' ^' **) $m ( v ' z ») /« fa) dv - 

Z=l m=l ^°° 

Further, we can write the following two estimates: for any L > and 
l^d , \z 2 \ < L (z 2 > Zi) we have 



E \B Tj3 (Y,z 2 )-B Tfi (Y,z 1 ) 

4 /^ i {Ys>Z2} -f q o:) u „ ,,.- ■/■;,(>;) 



/o (r.: 



/o (^) 



d.s 



"-2 



dt> 
i £fa) 



< Ay/ne L \z 2 — Z\\ = C \z 2 — zA . 



and for any k > there exist a constant L > 0, which does not depend on 
T, such that 

/ E [B Tp (Y, z) - 2z M Tp (Y) + (l - 2z 2 ) N Tp (Y)] 2 /„ (zf dz < k. 

J\z\>L 

Hence (see [5J, Theorem 9.7.1) 

/oo 
[B Tp (Y,z) + zM Tp (Y) - (1 - 2z 2 ) N Tp (Y)] 2 /„ {zf dz + o(l) 
-oo 

C/(^) 2 /oW 3 dz. 

Therefore i/jt & IC £ . 

For the second statistics A^ (^ T ) we have the similar relations. The 
empirical distribution function has the representation 



8 



t(f t(x) -f ( ^)) = ^J^ - 37 ^- 5 d, 

F($,xA X t ) - F (#, x) F (#, X t ) 



<Tf(#,X t ) 



dW t , 



and we can write 



F (#, x) F (■&, X t ) - F (#, x A X., 



TJ <jf{$,Xt> 

2 f Tp F (z) F (Y s ) -F (zA Y s 



^dw t 



/o (Y a ) 



[dY s + Y s ds] 



A T p{Y,z) 



Further (z = \f^o 1 (x — a)) 



T[F[& T ,x)-F(&,x 

dF($,x 



T (&t — & 

1 



da 



t(*-/?)^!^+o(i; 



Remind that 



Finally, 



M T/3 (Y) /o (z) - ^=iV T/3 (y) / (*) + o(l). 



f=l + o(l>. 



/oo 
[A T/3 (y *) + m T/3 (y) + ^ (y)] 2 / (^) 3 d^ + o (i) 
-oo 

Cf (a:) 2 Jo (^) 3 dx. 



Therefore the test <p T (X T ) is APF. 

Note that the tests of Kolmogorov-Smirnov type have the similar APF 
property. Indeed, let us introduce the statistics 

5 T (X T ) = sup VT\f T (x)-f (tf T , x) | a, 
A T (X t ) = sup Vt\f t (x)-F (# t , x) | \/At- 



It can be shown that under hypothesis these statistics have the limits 
S T (A t ) =}► sup \(f (x)\ f (x) , A T {X T ) =}► sup \( F (x)\ f (x) 

X X 

The weak convergence of the corresponding stochastic processes can be proved 
using the techniques developed in Kutoyants [8J, Theorem 4.13 and in Negri 
[IT] respectively. 
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